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Abstract 

The Local Search algorithm (or Hill Climbing, or Iterative Improvement) is one of the simplest 
heuristics to solve the Satisfiability and Max-Satisfiability problems. It is a part of many satisfiability 
and max- satisfiability solvers, where it is used to find a good starting point for a more sophisticated 
heuristics, and to improve a candidate solution. In this paper we give an analysis of Local Search on 
random planted 3-CNF formulas. We show that if there is k < | such that the clause-to-variable ratio is 
less than k In n (n is the number of variables in a CNF) then Local Search whp does not find a satisfying 
assignment, and if there is k > | such that the clause-to-variable ratio is greater than k In n then the local 
search whp finds a satisfying assignment. As a byproduct we also show that for any constant g there is 7 
such that Local Search applied to a random (not necessarily planted) 3-CNF with clause-to-variable ratio 
g produces an assignment that satisfies at least jn clauses less than the maximal number of satisfiable 
clauses. 

1 Introduction 

A CNF formula over variables xi, ■ ■ ■ , Xn is a conjunction of clauses ci, . . . , Cm where each clause is a 
disjunction of one or more literals. A formula is said to be a A;-CNF if every clause contains exactly k 
literals. In the problem fc-SAT the question is, given a A;-CNF, decide if it has a satisfying assignment (find 
such an assignment for the search problem). In the MAX-/c-SAT problem the goal is to find an assigiunent 
that satisfies as many clauses as possible. The problem A;-SAT for A; > 3 is one of the first problems proved 
to be NP-complete problems and serves as a model problem for many algorithm and complexity concepts 
since then. In particular, Hastad ifTTl proved that the MAX-/c-SAT problem is NP-hard to approximate within 
ratio better than 7/8. These worst case hardness results motivate the study of the typical case complexity of 
those problems, and a quest for probabilistic or heuristic algorithms with satisfactory performance, in the 
typical case. In this paper we analyze the performance of one of the simplest algorithms for (MAX-)/c-SAT, 
the Local Search algorithm, on random planted instances. 

The distribution. Let us start with planted instances. One of the most natural and well studied probability 
distributions on the set of 3-CNFs is the uniform distribution <l>(n, m(n)) on the set of 3-CNFs with a given 
clauses-to-variables ratio |[T4l . It can be constructed and sampled as follows. Fix the number m = m(n) of 
3-clauses as a function of the number n of variables. The elements of <I>(n, m{n)) are 3-CNFs generated by 
selecting m = m{n) clauses over variables xi, . . . , Xn- Clauses are chosen uniformly at random from the 
set of possible clauses, and so the probability of every 3-CNF from $(n, m{n)) is the same. An important 
parameter of such CNFs is the clause-to-variable ratio, — , or density of the formula. We will use the density 
of a 3-CNF rather than the number of clauses, and so we write <I>(n, gn) instead of <^(n, m{n)). Density 
can also be a function of n. 
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However, the typical case complexity for this distribution is not very interesting except for a very nar- 
row range of densities. The reason is that the random 3-SAT under this distribution demonstrates a sharp 
satisfiability threshold in the density [2|. A random 3-CNF with density below the threshold (estimated to 
be around 4.2) is satisfiable whp (with high probability, meaning that the probability tends to 1 as n goes to 
infinity), and a 3-CNF with density above the threshold is unsatisfiable whp. Therefore the trivial algorithm 
outputting yes or no by just counting the density of a 3-CNF gives a right answer to 3-SAT whp. For more 
results on the threshold see ifTOlfTTl fTlfTSll. It is also known that, as the density grows, the number of clauses 
satisfied by a random assignment differs less and less from the maximal number of satisfiable clauses. If 
density is infinite (meaning it is an unbounded function of n), then whp this difference becomes negligible, 
i.e. o{n). Therefore, distribution <I>(n, gn) is not very interesting for MAX-3-SAT, at least when density is 
large, as one can get whp a very good approximation just by checking a random assignment. 

A more interesting and useful distribution is obtained from <I>(n, gn) by conditioning on satisfiability: 
such distribution is uniform and its elements are the satisfiable 3-CNFs. Then the problem is to find or ap- 
proximate a satisfying assignment knowing it exists. Unfortunately, to date there are no techniques to tackle 
such problems (see, e.g., [6, 9|), particularly, to sample the satisfiable distribution. A good approximation 
for such a distribution is the planted distribution <I>P'^"*(n, gn), which is obtained from <I>(n, gn) by condi- 
tioning on satisfiability by a specific "planted" assignment. To construct an element of a planted distribution 
we select an assignment of a set of n variables and then uniformly at random include gn clauses satisfied by 
the assignment selected. Some attempts have been made to define a better approximation of the satisfiable 
distribution, see, e.g. ||20l . however, the analysis of such distributions is difficult and it is not clear if they 
are closer to the distribution sought. 

Another interesting feature of the planted distribution is that there is a hope that it is possible to design 
an algorithm that solves all planted instances whp. Some candidate algorithms were suggested in l6l lT3ll2Tl . 
Algorithm from [ 13 ] and [ ,21] use different approaches to solve planted 3-SAT of high density. Experiments 
show that the algorithm from lH achieves the goal, but a rigorous analysis of this algorithm is not yet made. 
For a wider survey on SAT algorithms the reader is referred to |[23i r7l. 

The algorithm. The Local Search algorithm (LS) is one of the oldest heuristics for SAT that has been 
around since the eighties. Numerous variations of this method have been proposed since then, see, e.g., |[T5l 
l25l . We study one of the most basic versions of LS, which, given a CNF, starts with a random assignment 
to its variables, and then on each step chooses at random a variable such that flipping this variable increases 
the number of satisfied clauses, or stops if such a variable does not exist. Thus LS finds a random local 
optimum accessible from the initial assignment. 

LS has been studied before. The worst-case performance of pure LS is not very good: the only known 
lower bound for local optima of a /c-CNF is of clauses satisfied, where m is the number of all clauses 

|[16|. In [19 1, it is shown that if density of 3-CNFs is linear, that is, m = Q{n'^), then LS solves whp a 
random planted instance. Finally, in |[8l, we gave an estimation of the dependence of the number of clauses 
LS typically satisfies and the density of the formula. 

Often visualization of the number of clauses satisfied by an assignment is useful: Assignments can 
be thought of as points of a landscape, and the elevation of a point corresponds to the number of clauses 
unsatisfied, the higher the point is, the less clauses it satisfies. It is suspected that 'topographic' properties 
of such a landscape are responsible for many complexity properties of satisfiability instances. For example, 
it is believed that the hardness of random CNFs whose density is close to the satisfiability threshold is 
due to the geometry of the satisfying assignments. They tend to concentrate around several centers, that 
make converging to a solution more difficult ^ |22l. As we shall see the performance of LS is closely 
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related to geometric properties of the assignments, and so we hope that the study of LS may lead to a better 
understanding of those properties. 

The behavior of other SAT/MAXSAT algorithms have been studied before. For example, the random 
walk has been analyzed in 1,24,1 and then in 111. A message passing type algorithm. Warning Propagation, is 
studied in 11121 . 

Our contribution. We classify the performance of LS for all densities higher than an arbitrary constant. 
In particular, we demonstrate that LS has a threshold in its performance. The main result is the following 
theorem. 

Theorem 1 (1) Let q> k • Inn, and k > |. Then the local search whp finds a solution of an instance from 
^>P'^"t(n,£in). 

(2) Let c < g < K ■ Inn, a a constant, and k < g. Then the local search whp does not find a solution of an 
instance from <I>p'^"*(?i, gn). 

To prove part (1) of the theorem[T]we show that under those conditions all the local optima of a 3-CNF 
whp are either satisfying assignments, that is, global optima, or obtained by flipping almost all the values 
of planted solution, and so are located on the opposite side of the set of assignments. In the former case LS 
finds a satisfying assignment, while whp it does not reach the local optima of the second type. We also show 
that that for any constant density g there is 7 such that the assignment produced by LS on an instance from 
$P'^"*(n, gn) or <I>(n, gn) satisfies at least jn clauses less than the maximal number of satisfiable clauses. 
Unfortunately, it is somewhat difficult to run computational experiments on CNFs of infinite density, as in 
order to have log n sufficiently large n must be prohibitively big. However, experiments we were able to 
conduct agree with the results. 

Another region where LS can find a solution of the random planted 3-CNF is the case of very low 
density. Methods similar to Lemma |9] and Theorem [TT] show that this low density transition happens around 
g M n~^/^. However, we do not go into details here. 

Usually the main difficulty of analysis of algorithms for random SAT is to show that as an algorithm 
runs, some kind of randomness of the current assignment is kept. This property allows one to use 'card 
games', Wormald's theorem, and differential equations as in [1, 8], or relatively simple probabilistic con- 
structions, such as martingales, as in |3 |. For LS randomness cannot be assumed after just a few iterations 
of the algorithm, which makes its analysis more difficult. This is why the most difficult part of the proof 
is to identify to which extent assignments produced by LS as it runs remain random, while most of the 
probabilistic computations are fairly standard. 

The paper is organized as follows. After giving several necessary definitions in Section 2, we prove in 
Section 3, that above the threshold established in Theorem [U planted 3-CNFs do not have local optima that 
can be found by LS, other than satisfying assignments. In Section 4 we show that below the threshold there 
are many such optima, and that LS necessarily gets stuck into one of them. 

2 Preliminaries 

SAT. A 3-CNF is a conjunction of 3-clauses. As we consider only 3-CNFs, we will always call them just 
clauses. Depending on the number of negated literals, we distinguish 4 types of clauses: (—,—,—),(+,—,—), 
(+, +, — ), and (+, +, +). If 99 is a 3-CNF over variables xi, . . . , x„, an assignment of these variables is 
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Input: 3-SAT formula ip over variables xi, . . . , x„. 
Output: Boolean n-tuple v, which is a local minimum of ip. 
Algorithm: 

choose uniformly at random a Boolean n-tuple u 

let U be the set of all variables xi such that the number of clauses that can be made satisfied 

by flipping the value of xi is strictly greater than the number of those made unsatisfied 
while U is not empty 

pick uniformly at random a variable Xj from U 

change the value of Xj 

recompute U 



a Boolean n-tuple u = (ui, . . . , Un), so the value of Xi is Uj. The density of a 3-CNF tp is the number ^ 
where m is the number of clauses, and n is the number of variables in tp. 

The uniform distribution of 3-CNFs of density g (density may be a function of n), <^(n, gn) is the set 
of all 3-CNFs containing n variables and gn clauses equipped with the uniform probability distribution on 
this set. To sample a 3-CNF accordingly to $(n, gn) one chooses uniformly and independently gn clauses 



within a clause. Random 3-SAT is the problem of deciding the satisfiability of a 3-CNF randomly sampled 
accordingly to <I>(n, gn). For short, we will call such a random formula a 3-CNF from ^{n, gn). 

The uniform planted distribution of 3-CNF of density g is constructed as follows. First, choose at random 
a Boolean n-tuple u, a planted satisfying assignment. Then let <I>P'^"*(n, gn, u) be the uniform probability 
distribution over the set of all 3-CNFs over variables xi, . . . , x„ with density g and such that u is a satisfying 
assignment. For our goals we can always assume that u is the all-ones tuple, that is a 3-CNF belongs to 
<I>P'^"*(n, gn, u) if and only if it contains no clauses of the type (— , — , — ). We also simplify the notation 
$P'^"*(n, gn, u) by <l>P'^"*(n, gn). To sample a 3-CNF accordingly to <I>P'^"*(n, gn) one chooses uniformly 

and independentiy gn clauses out of 7 (^^^ possible clauses of types (+, — , — ), (+, +, — ), and (+, +, +). 

Random Planted 3-SAT is the problem of deciding the satisfiability of a 3-CNF from $P'^"*(n, gn). 

The problems Random MAX-3-SAT and Random Planted MAX-3-SAT are the optimization versions of 
Random 3-SAT and Random Planted 3-SAT. The goal in these problems is to find an assignment that satisfies 
as many clauses as possible. Although the two problems usually are treated as maximization problems, it 
will be convenient for us to consider them as problems of minimizing the number of unsatisfied clauses. 
Since we always evaluate the absolute error of our algorithms, not the relative one, such transformation does 
not affect the results. 

Local search. A formal description of the Local Search algorithm (LS) is given in Fig.[T] Observe that LS 
stops when reaches a local minimum of the number of unsatisfied clauses. 

Given an assignment u and a clause c it will be convenient to say that c votes for a variable Xj to have 
value 1 if c contains literal Xi and its other two literals are unsatisfied. In other words if either (a) u assigns 
Xi to 0, c is not satisfied by u, and it will be satisfied if the value of Xi is changed, or (b) the only literal in 
c satisfied by u is Xj. Similarly, we say that c votes for Xj if c contains the negation of Xj and its other two 



Figure 1: Local Search 




Thus, we allow repetitions of clauses, but not repetitions of variables 
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literals are not satisfied. Using this terminology we can define set U as the set of all variables such that the 
number of votes received to change the current value is greater than the number of those to keep it. 



Random graphs. Probabilistic tools we use are fairly standard and can be found in the book f^. 

Let 9? be a 3-CNF with variables xi, . . . , Xn- The primal graph G{ip) of is the graph with vertex 
set {xi, . . . , Xn} and edge set {xiXj \ literals containing Xi, Xj appear in the same clause}. The hypergraph 
H{(p) associated with ip is a. hypergraph, whose vertices are the variables of if and the edges are the 3- 
element sets of variables belonging to the same clause. Note that if 93 € <I>P'^"*(n, gn), then H{ip) is a 
random 3-hypergraph with n vertices and gn edges, but G{n) is not a random graph. 

We will need the following properties that a graph G{ip) of not too high density has. 

Lemma 2 Let g < nlnnfor a certain constant k, and let Lp G <I)P'^"*(n, gn). 

(1) For any a < 1, whp all the subgraphs of G{(p) induced by at most 0{n") vertices have the average 
degree less than 5. 

(2) The probability that G{(p) has a vertex of degree greater than \r? n is o(n~'^). 

Proof: (1) This part of the lemma is very similar to Proposition 13 from [12], and is proved in a similar 
way. Let 5 be a fixed set of variables with \U\ = £. The number of 3-element sets of variables that include 
2 variables from U is bounded from above by 

(n - 2) < ^fn. 

For each of them the probability that this set is the set of variables of one of the random clauses chosen for 
if (we ignore the type of the clause) equals 

KTi In n 6k In n 



(n-l)(n-2) 

3, 



Thus, the probability that 2i of them are included as clauses is at most 



2£ J \{n- l)(n - 2) J - \ n 

Let d = e(3eK)^. Using the union bound, the probability that there exists a required set U with at most n'^ 
variables is at most 

21 




(dn^-Mn^n 



1 — dn" 1 In n 
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(2) The probability that the degree of a fixed vertex is at least In^ n is bounded from above by 

1 Y"^ n / In n\ _ n ( 3eKn In n V'^ ^ _ / Se/t V"^ " 
n) \\n^n)-'' ) ~ ' 

where 

^-In n 

the probability that some particular In^ n random clauses include x, and i '^'^"^^^^ j 
number of In^ n-element sets of clauses. Then it is not hard to see that 



\lnn/ ' 

as n goes to infinity. □ 

Several times we need the following corollary from Azuma's inequality for supermartingales (see Lemma 1 
from 1 26 1). 

Observation 3 (1) Let Yf be a supermartingale such that E < Yt '^nd — Yt\< cfor some 

c. Then P {Yt - Fq > be) < e'-t Jor any b > 0. 

(2) This inequality implies that ifEi {YtJ^i\Yt) < Yt — d and [Fi+i — Yt\ < c < 1 then the process 
Zt = Yf — dt is a supermartingale and we have the following inequality 

(/ dtW (b+dt)'^ , , 

Zt-Zo<ib + ^]]<e-^^<e-'"'. (1) 

The following lemma is a simple corollary of Chemoff bound. 

Lemma 4 Let r, s be integers, < 1 a positive real, and let ai, . . . ,ar, Pi, ■ ■ ■ , f3s be some real constants. 
There are constants A and C such that we have 

V{X>Y)< Ce^^^(^) (2) 

r s 

for any random variables X and Y such that E [X) < 6E (Y) and X = otiXi, Y = ^ [3iYifor some 

j=0 i=0 

binomial random variables Xi, . . . , X^, Yi, . . . , Fg. 

Proof: Let £ = , , s t-^—t — \ tttty- It is easy to see that event X > Y implies occurrence of at 

least one of the events from the set 

S = {{Xi > E (X,) + £E (y)}ie{o,...,r}, {Yi < E (y,) - £E {Y)},^{o,...,s}}- 

Indeed, inequality X <Y can be derived from inequalities, opposite to the ones in S and E (X) < OE (Y). 
Application of Chernoff bound gives us inequalities 

2/ E(y) 



P(|X-E(X,)|>eE(y)) < e'^'^'^'^^A^j < e-«^E(y)e-73 



p(|y-E(y,)l>eE(y)) < e "'^'''^^'t^SV' < e-^'^m/^. 

Thus if we set A = £^/3, C = r + s then using union bound we can conclude that inequality ^ holds. □ 
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3 Success of Local Search 



In this section we prove the first statement of the Theorem [J 1). This will be done as follows. First, we 
show that if a 3-CNF has high density, that is, greater than k log n for some k > | then whp all the local 
minima that do not satisfy the CNF — we call such minima proper — concentrate very far from the planted 
assignment. This is the statement of Proposition [8]below. Then we use Lemma[5]to prove that starting from 
a random assignment LS whp does not go to that remote region. Therefore the algorithm does not get stuck 
to a local minimum that is not a solution. 

Several times we will need the following observation that can be checked using the inequality 

< (x)^- ^"^y ^'^'^ ^ ^^^^ < a < 1 

\ ^ Q)7n" Inn— 7n" ln7+7n°' z-^") 

We need the following two lemmas. Recall that the planted solution is the all-ones one. 

Lemma 5 Let q > uln n for some constant k, and let constants qo,qi be such that qo < qi. Whp any 
assignment with q^n zeros satisfies more clauses than any assignment with qin zeros. 

Proof: Let u,vbe. some vectors with qon and qin zeros, respectively. Let c be a random clause, then 
(1) with probability I all its literals are positive, (2) with probability | two literals are positive and similar 
(3) with probability | one literal is positive. The probabilities that the clause is satisfied by u in these cases 
are {I — qo)^ , {1 — qo)"^ qo and (1 — <?o)9o' respectively. Hence the total probability of a clause to be satisfied 
by u equals (-^ ''o) +^(^ io) go+3(i 1510)90 _ l_£o ^ similar result holds for v. Thus the expectation of the 

Y n'^ 1 

number of clauses satisfied by u and v in a random formula equals -^j^nn In n and —^kti In n respectively, 
thus applying lemma |4] we conclude that 

P n satisfies less than i^^^^af^/tn Inn clauses) < e"^'"!"^", 

for some A' > 0. There are 2" assignments, hence, apphcation of the union bound finishes proof of the 
lemma. □ 



Lemma 6 Let g > Klnnfor some k (not necessarily > |j. There is a < 1 such that for G gn) 
whp for any proper local minimum uofip the number of variables assigned to by u is either less than n", 
or greater than j^. 

Proof: Let M, \M\ = £ be the set of all variables that u assigns to 0. Let B'j^f^ be event "for every 
Xj € M the number of clauses voting for Xi to be 1 is less than or equal to the number of clauses voting for 
Xi to be 0". Since u is a local minimum, B'j^^'^^ is the case for u. It is easy to see that event B^j^'^^ implies 
event B'^^ = "the total number of votes given by clauses for variables in M to be 1 is less than or equal to 
the total number of votes given by clauses for variables in M to be 0". To bound the probability of B^j^'^^ 
we will bound the probability of Bf}. 

Let c be a random clause. It can contribute from to 3 votes for variables in M to be one and or 1 
vote for them to remain zero. Let us compute, for example, the probability that it contributes exactly two 
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votes for variables in M to become one. It happens if c is of type (+,+,—), both its positive variables 
are in M and the negative variable is outside of M. Probability of this event is — £/n). So the 

expectation of the number of clauses voting for exactly 2 variables in M to be 1 is j£'^n~^{l — £/n)Klnn. 
The expectations of the numbers of clauses voting for three and one variables to be 1 are ^£^n~'^K\nn and 
|(1 — Inn, respectively. 

A clause votes for a variable in M to remain if its type is (+, — , — ), one of its negative literals 
is not in M, and two other literals are in M, or if its type is (+,+,—) and all the variables in it be- 
long to M. Thus the expectation of the number of clauses voting for variables in M to remain is 
'^K\nn{2£'^n-^{l-£/n)+£^n-^). 

Hence the expectation of the number of votes for variables in M to flip equals 

/ 1 3 3 

E (votes for a flip) = k Inn x ( 3 • -^^n"^ + 2 • -£'^n-^{l - £/n) + 1 • -£{1 - £/nf 

and expectation of the number of votes for variables in M to remain equals 

3 



E (votes for status quo) = Klnn x (^£^n ^(1 



n) + -£^n-'^ 



\f£ < j^n then 



E (votes for status quo) Q£{n - £) + 3£'^ 3(n 



E (votes for a flip) 6£{n - £) + 3£'^ + 3(n - £)2 6£{n - £) + 3£^ + 3(n - £f 

^ ^ 3 • Yggra^ ^ ^ ^_ 

12n2 400' 



Therefore we can apply Lemma |4] to the votes for and against Os and get the following bound P (B'^l) < 
g-AE(votes for a flip) some A > 0. Then we can bound number of votes for a flip from below by 6£ In n 
for some constant 6 and we can bound the number of sets M of size £ as 



n 



#(Mof size£) = < = e^i-(«/^)+^ 

Therefore if 

£ln{n/£) + £< 6£lnn 

then union bound implies that whp there is no set M such that Bfl happens. It is easy to see that for £ > 
and a that is close enough to 1 the above inequality holds, which finishes the proof of the lemma. □ 



Now suppose that n is a proper local minimum of 99 G <^P'^"*(n, gn). There is a clause c € (/? that is not 
satisfied by u. Without loss of generality, let the variables in c be xi, 3:2, 3:3, and let the variable assigned 
be xi. Thus, clause c votes for xi to be flipped to 1. Since -u is a local minimum there must a clause that is 
satisfied, that becomes unsatisfied should xi flipped. We call such a clause a support clause for the value 
of xi. In any support clause the supported variable is negated, and therefore any support clause has the type 
(+,—,—) or (+,+, —). A variable of a CNF is called k-isolated if it appears positively in at most k clauses 
of the type (+, — , — ). The distance between variables of a CNF (/? is the length of the shortest path in G{if) 
connecting them. 
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Lemma 7 If k > ^ and g > Klnn then for any integers di,d2 > 1 and for a random ip € gn) 
whp there are no two di-isolated variables within distance d2from each other. 

Proof: Let x be some variable. The probability that it is di -isolated can be computed as 

, \ / o \ Kn\nn—di 

Knmn\ I o ^ 



P(xisdi-isolated) = d^ ■ [ ^] [l 



2 \ nn Inn/ 3\ '^^/7 ^ ^'^^ 



~ *0 -I;) "(y 

for any e > 0. 

By Lemma [2t2), the degree of every vertex of G{(p) whp does not exceed \t? n. Hence, there are at 
most In^'^^ n vertices at distance d2 from x. Applying the union bound we can estimate the probability that 
there is a di-isolated vertex at distance d2 from x as 0{\t?'^'^ n ■ n~T'^). Finally, taking into account the 
probability that x itself is di-isolated, and applying the union bound over all vertices of G{lp) we obtain 
that the probabiUty that two di -isolated vertices exists at distance d2 from each other can be bounded from 
above by 

n • 0(n-^) • 0{\n^'^^ n ■ n-l") = 0{ln^'^^ n ■ v}'^'^). 
Thus for K > X whp there are no two such vertices. □ 



Proposition 8 Let g > n - Inn, and k > ^. Then whp proper local minima of a 3 -CNF from <^P'^"*(n, gn) 
have at most ^ ones. 

Proof: Let tp € <I>P'^"*(n, gn) be a random planted instance. Suppose that n is a proper local minimum 
that has more than ones. We use the following observation. Let c be a clause not satisfied by u. Then 
it contains at least one variable Xi that is assigned to zero by u. The assignment n is a local minimum, so 
there must be a clause c' that is satisfied only by xi. Hence, c' is a support clause, and contains a variable 
Xj which is assigned to zero by u. Variables xi and Xj are at distance 1. Setting di = 11 and d2 = 1, by 
Lemma |2l we conclude that one of them is not 11 -isolated. 

Set di = 11, (i2 = 3 and consider the set Z of all variables assigned to zero by n that are not 11 -isolated. 
By the observation above this set is non-empty. On the other hand, by Lemma [6l is 0(n") for some 
a < 1. Consider x G Z. It appears positively in at least 10 clauses of the type (+, — , — ). Each of these 
clauses is either unsatisfied or contains a variable assigned to 0. Suppose there are k unsatisfied clauses 
among them. Since u is a local minimum, to prevent x from flipping, x must be supported by at least k 
support clauses, each of which contains a variable assigned to 0. Thus, at least 6 neighbors of x in G{ip) are 
assigned to 0. Any two neighbors of x are at distance 2. By Lemma [V] at least 5 of the neighbors assigned 
to are not 1 1-isolated, and therefore belong to Z. Thus the subgraph induced by Z in G{ip) has the average 
degree greater than 5, which is not possible by Lemma|2tl). □ 

Now we are in a position to prove statement (1) of Theorem [T] 
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Figure 2: Caps and crowns 



Proof: [of Theorem [TJl)] By Lemma|5]for a 99 € gn) whp any assignment with dn variables 

equal to 1, where ^ < d < |, satisfies more clauses than any assignment with equal to 1. Then, whp a 
random initial assignment for LS assigns between ^ and | of all variables to 1 . Therefore, whp LS never 
arrives to a proper local minimum with less than ^ variables equal to 1, and, by Proposition [H to any proper 
local minimum. □ 



4 Failure of Local Search 

We now prove statement (2) of Theorem [T] The overall strategy is the following. First, we show. Propo- 
sition [lOl that in contrast to the previous case there are many proper local minima in the close proximity 
of the planted assignment. Then we show. Proposition [T2j that those local minima are located so that they 
intercept almost every run of LS, and thus almost every run is unsuccessful. 

We start off with a technical lemma. A pair of clauses ci = (xi, X2, X3), C2 = (xi, X4, X5) is called a 
cap if xi, X5 are 1-isolated, that is they do not appear in any clause of the type (+, — , — ) except for ci and 
C2, respectively, and X2, X3 are not 0-isolated (see Figure|3a)). We denote equality f{n) = g{n){l + o(n)) 
by /(n) ~ gin). 

— - 7 
Lemma 9 Let n 4 < ^ < ^ • In n, and k < ^. There is a, ^ < a < 1, such that whp a random planted 

CNF Lp S <I>P'^"*(n, qn) contains at least n" caps. 

Proof: The proof is fairly standard, see, e.g. the proof of Theorem 4.4.4 in [T]. We use the second 
moment method. The result follows from the fact that a cap has properties similar to the properties of 
strictly balanced graphs, see |4|. Take some n, and let X be a random variable equal to the number of caps 
in a 3-CNF Lp G <I>P'^"*(n, gn). Straightforward calculation shows that the probability that a fixed 5-tuple of 
variables is a cap is ~ g^n~'^~7ii^. Therefore E (X) ~ g^n^~7ii^. 

Let S'be a fixed 5-tuple of variables, say, S = (xi, X2, X3, X4, X5), and ^4^ denote the event that S forms 
a cap. For any other 5-tuple T, the similar event is denoted by At, and we write At x As if these two 
events are not independent. By Corollary 4.3.5 of H it suffices to show that 

A* = ^P{AT\As)=o{BiX)). 

Let T = (y 1 , 2/2 5 2/3 1 2/4 1 2/5 ) ■ It is not hard to see that the only cases when At and ^4^ are not independent 
and the probability P (At \ As) is significantly different from is: yi = xi and {1/2, Us} = {x2, or 
yi = X5 and {2/2,2/3} = {xi,Xi}, ory^ = xi and {2/1,2/4} = {x2,X3}, or 2/5 = X5 and {yi, 2/4} = {xi,X4}. 
Then, as before, it can be found that in each of these cases P (At \ As) = 0{g'^n~'^~7i^). 
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Finally, 

A* = ^ P (^T I As) = n^P {At \ As) = v? ■ 0(/n" 



7 In n 



.3 g 



= 0(/n-^i#^) = o(E(X)). 
We can choose a = 1 — |k if > 1, and a = l — 4i/ifl>^p> for f < \. □ 



Proposition 10 Let g < k • Inn, and k < |. Then there is a, ^ < a < 1, such that a 3 -CNF from 
<I)P'^"*(n, gn) whp has at least proper local minima. 

Proof: Let ci = (xi, X2, X3), 02 = (^1,^4, 3^5) be a cap and u an assignment such that ^3 = ^5 = 0, 
and Ui = 1 for all other i. It is straightforward that u is a proper local minimum. By Lemma |9l there is a 
such that whp the number of such minima is at least n". □ 



Before proving Proposition [T2l we note that a construction similar to caps helps evaluate the approx- 
imation rate of the local search in the case of constant density on planted and also on arbitrary CNFs. A 
subformula c = (xi, X2, X3), ci = (xi, X4, X5), C2 = (x2, xe, X7), C3 = (X3, xs, xg) is called a crown if 
the variables xi, . . . , xg do not appear in any clauses other than c, ci, C2, C3 (see Fig. I3b)). The crown is 
satisfiable, but the all-zero assignment is a proper local minimum. For a CNF ip and an assignment u to its 
variables, by 0PT((/9) and sat(u) we denote the maximal number of simultaneously satisfiable clauses and 
the number of clauses satisfied by u, respectively. 

Theorem 11 If density g is such that n^'^ < g < ulnnfor some v < 1/4 and k < 1/27, then there is 
7g, = such that whp Local Search on a 3-CNF ip € <I>(n, gn) fv? € <I>P'^"*(n, gn)) returns an assignment 
u such that OPT{ip) — sat{u) > 'jig) ■ n, where OPT(ip) denotes the maximal number of clauses in ip that 
can be simultaneously satisfied and sat{u) denotes the number of clauses satisfied by u. 
If g is constant then 7^ is also constant. 

Proof: As in the proof of Lemma|9l it can be shown that for g that satisfies conditions of this theorem 
there is 7' = such that whp a random [random planted] formula has at least 7'n crowns. If ^> is a 

constant, 7' is also a constant. For a random assignment u, whp the variables of at least j^n crowns are 
assigned zeroes. Such an all-zero assignment of a crown cannot be changed by the local search. □ 

Then we move on to proving Proposition [T2l 

Proposition 12 Let g < k - Inn, and k < ^. The local search on a 3-CNF from $P'^"*(n, £-n) whp ends up 
in a proper local minimum. 

If g = o(ln n) then Proposition [12] follows from Theorem [TT] So in what follows we assume that 
g > k' • Inn. The main tool of proving Proposition [T2| is coupling of local search (LS) with the algorithm 
Straight Descent (SD) that on each step chooses at random a variable assigned to and changes its 
value to 1. Obviously SD is not a practical algorithm, since to apply it we need to know the solution. For the 
purposes of our analysis we modify SD as follows. At each step SD chooses a variable at random, and if it 
is assigned changes its value (see Fig.Ua)). The algorithm LS is modified in a similar way (see Fig.lHb)). 

It is easy to see that the vector obtained by SD at step t does not depend on the formula. And since SD 
treats all variables equally we can make the following 
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Input: (f e <^>P'^"*(ra, gn) with the all-ones solution, 

Boolean tuple u, 
Output: The all-ones Boolean tuple. 

Algorithm: 

while there is a variable assigned 
pick uniformly at random variable Xj from 

the set of all variables 
if Uj = then set Uj = 1 

(a) 



Input: 3-SAT formula ip, Boolean tuple u. 
Output: Boolean tuple v, which is local 
minima of ip. 

Algorithm: 

while u is not a local minima 

pick uniformly at random variable Xj from 

the set of all variables 

if the number of clauses that can be made 
satisfied by flipping the value of Xi is strictly 
greater than the number of those made unsatisfied 
then set Uj = Ui 



(b) 

Figure 3: Straight Descent (a) and Modified Local Search (b) 

Lemma 13 If SD starts its work at a random vector with mo ones and after step t, t < n — uiq, it arrives 
to a vector with m ones, then this vector is selected uniformly at random from all vectors with m ones. 

Proof: Let us denote the probability that at step t SD arrives to vector u, conditional to it starts from a 
vector with mo ones, by P {u, t, mo). We prove by induction on t that P {ui , t, mo) = P (^2, t, mo) for any 
ui,U2 with m ones. We denote this number by P {t, m, mo). As the starting vector is random, it is obvious 
for t = 0. Then for t > 1 and any vector u with m ones we have 

P (u, t, mo) = F (u,t — 1, mo) • — + P (u', t — 1, mo) • — 

u' 

TTl TTl 

= P (t — 1, m, mo) • h P (t — 1, m — 1, mo) • — , 

n n 

where n is the number of variables in the formula and u' goes over all vectors that can be obtained from u 
by flipping a one into zero. It does not depend on a particular vector u. □ 

We will frequently use the following two properties of the algorithm SD. 

Lemma 14 Whp the running time ofSD does not exceed 2n In n. 

Proof: For a variable Xi the probability that it is not considered for t steps equals (l — ^)*. So for 
t = 2nlnn this probabihty equals < g-2inn _ ^-2^ Applying the union bound over all 

variables we obtain the required statement. □ 

Given 3-CNF ip and an assignment u we say that a variable xi is k-righteous if the number of clauses 
voting for it to be one is greater by at least k than the number of clauses voting for it to be zero. Let 
if G $P'^"*(n, gn) and ■u be a Boolean tuple. The ball of radius m with the center at u is the set of all tuples 
of the same length as u at Hamming distance at most m from u. Let f{n) and g{n) be arbitrary functions 
and d be an integer constant. We say that a set S of n-tuples is {g{n),d)-safe, if for any u e S the number 
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of variables that are not d-righteous does not exceed g{n). A run of SD is said to be {f {n) , g{n) , d)-safe if 
at each step of this run the ball of radius f{n) with the center at the current assignment is {g{n), (i)-safe. 

Lemma 15 Let q > k' ■ Inn for some k'. For any constants 7 and d there is a constant qi < 1 such that, 

for any a > ai, whp a run ofSD on if G <l>P'^"*(n, gn) is {'yn'^,n'^,d)-safe. 

Proof: Consider a run of SD on 99 € <l>P'^"*(n, gn) with a random initial assignment. If SD starts its 
work at a tuple with ttiq ones, then at step t it has m < niQ + t ones. Then by Lemma [13] if at step t the 
current assignment of SD has m ones then it is drawn uniformly at random from all vectors with m ones. 
Event Unsafe = "run of SD is not (771°, n°, d) -safe" is a union of events "at step t of SD's run the ball of 
radius 771° with the center at the current assignment is not (n", (i)-safe". We will use the union bound to 
show that probability of Unsafe is small. 

Let n be a Boolean n-tuple having pn positions filled with Is. Since whp the number of Is in the initial 
assignment is at least |, for every step the number of Is is at least |. Let M be an arbitrary set of variables 
with |M| = n". We consider events Bf}'^^ = "every variable Xj G M is not A;-righteous" and Bf:j = "the 
total number of votes given by clauses for variables in M to be 1 does not exceed the total number of votes 
given by clauses for variables in M to be plus \M\ ■ k." 

The same technique as in Lemma[6]can be used to show that the probability of Bl':} and consequently the 
probability of B'^f^ is bounded above by e^'^ for some constant A', not dependent on a. By inequal- 

ity (O, there are at most jn"' ■e^^^~°'^"'°' inn (i+o(i)) (jj^tj^ct assignments in the 7n" -neighborhood of SD and 
gn°(i-a) inn(i+o(i)) distinct subscts of sizc n°'. So for a close to 1 the union bound implies that Bf^^^ whp 
does not take place for any tuple, any subset of variables at any step which completes the proof of the 
lemma. □ 



For CNFs V'l , V'2 we denote by tpi A "02 their conjunction. 

We will need formulas that obtained from a random formula by adding some clauses in an 'adversarial' 
manner. Following [2T1 we call distributions for such formulas semi-random. However, the type of semi- 
random distributions we need is different from that in |[2l1l . Let r/ < 1 be some constant. A formula ip is 
sampled according to semi-random distribution 'I)^"'^^* (n, gn) if f = ip' Atp, where p' is sampled according 
to <I>P'^"*(n, gn) and contains at most n'' clauses and is given by an adversary. 

Corollary 16 Ifip' £ ^^'^^'^^{n, gn) then for any constants 7 and d there is a constant 02 < 1 such that for 
any a> a^a run of SD on ip' o ip is whp (7^", 2n", d)-safe. 

Proof: Let ai be obtained by application of Lemma [Tsl to p'. Let 02 = max(ai, r/). Then for a > 02 
whp run of SD on p' is (771", d)-safe. Since for n large enough tp contains less than n" variables run 
of SD will be (7^", 2n", d)-safe on p' Ax/j. □ 
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Lemma 17 Let (Dq , . . . , Di) be an integer random process, d > 0, and let L, H be integer constants such 
that 

(a) Do = 0,0<L< H, 

(b) \Dr+l-Dr\ = I, 

(c) if L < Dt < H the expectation of D^-^i conditional to satisfies the inequality E {Dt-^i\Dt-) < 
D-j- — d holds. 

j H-L 

Then the probability that there is r such that Dj^ > H is less than I ■ e 2 . 

Proof: We define a set of auxiliary processes d|: 

L, ifr<e, 

^ I D^, if (r > 0, {D^ = L) and {D^^ > L), for all C G . . . , r}), 

D(^- d{T - C), if T > ^, = L, and C G {Cj •••)''"} is the least such that < L, 
L — d{T — otherwise, i.e., ^ L and r > ^. 

The processes D^, . . . , D\. are designed so that every Dr for t > ^ satisfies inequality E ^^^+1 < 
Dr — d. Indeed, suppose that t > ^ L then 

E =L-d{T + l-i) = {L-{T-0-d = Dl-d. 

Let = L. If > L for all C{^, • • • , r} then = Dr, -D^+i = D^+i, and the result follows from the 
assumption E (-Dr+i \Dr) < — d. If there is ^ € {Ci • • • ) ''"} with Dq < L then 

E {dI^,\D^) = E {Di+,\DQ) =DQ-d{T + l-0 = (Dq - d{T - ()) - d = Df - d. 

By Azuma's inequality ^ for each ^ the probability of the event "there exists r such that d| = H" is 
less than e-(^-^)'^. 

On the other hand let D^ > L and be equal to the number of the most recent step for which D^ = L. 
It is easy to see that Dr = Dr. Thus if at some step Dr = H then there is ^ < r such that Dr = H. Using 
the union bound we get the required inequality. □ 



Lemma 18 Let q > n' ■ Inn for some k'. Let ip be a random 3-CNF sampled according to distribution 
^piant^^^ ^n) such that run of SD on tp is whp (71 n", 72^", l)-safe for some constants 71, 72 with 71 > 
372. Let Ud {m) , ui (m) denote the pair of assignments produced by the pair of processes ( SD,LS) on step m. 
For any t, whp the Hamming distance between Ud{t) and ui{t) does not exceed 71 n". 

Proof: Let Nt be the set of tuples at Hamming distance at most 7in" from Ud{t), and £ be event 
"ui{t) Nt for some t". LS starts with the same initial assignment as SD and we will show that it does not 
leave Nf. 

At some steps the distance between Ud{t) and ui{t) remains the same, and at some it changes. Let Ud, ui 
be the assignments produced by the algorithms after r changes have taken place, and Dr be the distance 
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between them. If 272?!" < D-r < 71 we have E {Dr+i \Dr) < Dr — \. Indeed, the number of variables 
voted to be zero does not exceed 72n" and is at least twice less than number of variables that differ in 
and ui{t). Since any change in the distance between the assignments happens if and only if a variable voted 
to be or a variable at which Ud{t) and ui{t) are different, we have the required inequality. Now we can 
apply Lemma[l7]for D setting L = 27271", H = 87271", d = 1/3 and get that probability of LS leaving Nt 
is less than £)ne~""/^. □ 



Corollary 19 For (/? € ^ (n, gn) there is a constant 03 such that distance between Ud{t) and ui{t) 
defined in Lemma\l8\whp does not exceed n"^. 

We say that a variable plays d-righteously in a run of LS if every time it is considered for flipping it is 
(i-righteous. Combining corollaries [16] and [19] we obtain the following 

Lemma 20 For any d there is a4 < 1 such that, for a run of LS on f ^ (n, gn) whp the number of 

variables that do not play d-righteously is bounded above by n"**. 

Proof: From Corollaries [16] and [19] it follows that whp at every step of LS the number of variables that 
are not d-righteous is less than ri", for some a. 

Therefore denoting the number of different assignments considered by LS by T (note that T < gn) 
and observing that at each step the probability to consider a variable voted to be is n"^^ we obtain the 
following upper bound for the expectation of the number of non-d-righteous variables throughout the run: 

for arbitrary e with a + 2e < 1. We apply Markov inequality and obtain P (/ > n"+^^) < ri^^, where / 
denotes the number of variables that do not play d-righteously. Now 04 can be set to be a + 2e. □ 



A clause {x, y, z) is called a cap support if there are wi , W2 such that {x,wi^W2, y, z) is a cap in ip. For 
a formula ip we denote the set of variables that occur in it by var(V'). For a set of clauses K we denote 
hy /\K a. CNF formula constructed by conjunction of the clauses. For the sake of simplicity we will write 
var(iir) instead of var (/\ K). In what follows it will be convenient to view a CNF as a sequence of clauses. 
Note that representation of a CNF is quite natural when we sample a random CNF by generating random 
clauses. This way every clause occupies certain position in the formula. For a set of positions P we denote 
the formula obtained from ip by removing all clauses except for occupying positions P hy ip [p. The set of 
variables occurring in the clauses in positions in P will be denoted by var(P). 

We denote by C the set of all possible clauses over n variables. Let us fix a real constant z/ < 1. We will 
need the following notation: 

• let [k] denote the set of the first k positions of clauses in cp, V be the set of all variables in ip; 

• let S"^''^ be the set of positions from [n'^] occupied by clauses that are cap supports in 99, and L^''^ the 
set of variables that occur in clauses in positions 5'^''^; 

• let T^'"^ be set of positions of ip occupied by clauses containing a variable from L^''^; 

• let U'^''^ be the set of positions in (p occupied by clauses containing a variable from var ( p j ; 
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• finally, let RP^" = [gn] \ (S^'^ U C/^''^); 

• let also WP'" = vaiiT^'") and = vaiiU'^'"). 
Fig. IHpictures the notation just introduced. 

m] mm unhm m3] gro wa ^ ibbs] ••• 



•ny first clauses 

Figure 4: A scheme of a 3-CNF. Every clause is shown as a rectangle with its hterals represented by squares 
inside the rectangle. Literals corresponding to variables from L'^"^ and from var are shown 

as diamonds and circles, respectively. Shaded rectangles with vertical and diagonal lines represent clauses 
from T't"' and V^" , respectively. 

Lemma 21 If p < kIuu and n < \ then there is po such that for any /i < /xq there is u < 1 such that whp: 

(1) \S'P^''\ ~ n^- 

(2) M^''^ n N'P''^ = 0, that is variables from clauses from V^'^ do not appear in the same clauses with 
variables from S'^''^; 

(3) \M'P''^\ = 3\T'P''^\, that is no variable occurs twice in the clauses from T^''^. 

Proof: It follows from Lemma |9] that for < /tlnn, k < | there exists a, < a < 1 such that the 
number of caps in the formula is ~ n". We set 

)Uo = a/2, V = p + 1 — a. 

(1) For a subset R of all positions of clauses in (j) let Cr denote event "R is exactly the set of positions 
occupied by cap supports". Obviously for any sets Ri,R2,\Ri\ = [i?2l we have P (C/j J = 'P {Cr^). 
Thus positions of the cap supports are selected uniformly at random without repetition. By straightforward 
computation we have expectation of the number of cap supports among first vy clauses equal approximately 
71° • ny~^ = = and variance is bounded above by the expectation, so it follows from 
Chebyshev inequality that random variable "number of cap supports among first -ny clauses" is whp ~ n^. 

(2) By Lemma|2l^2) whp there is no variable that occurs in more than In^ n clauses. Therefore \M^''^\ = 
0{n^^ \v? n) and \N'P'^\ = 0{'ny \p? n). These sets are randomly chosen from an n-element set, and there- 
fore the probability they have a common element is at most n^^'^^^ In^ n. Due to definition of // and v we 
have p. + v -I < a/2 + a/2 + l- a- l = 0. 

(3) Since whp |r'^''^| = 0(n'^ In^ n), the probability that two clauses from this set share a variable is 
bounded above by n^'^"^ In^ n. We have 2/x — l<a — l<Oso this probability tends to 0. □ 

Let us fix a formula ip selected accordingly ^>P'^"*(n, gn) and ^ < -g, and let v correspond to p as in 
LemmaEH Let Tq and Uq be subsets of [gn] such that Tq n C/q = 0, [n"] C Tq U f/o and let = Tq n [n!"]. 
We denote by i?ToC/o ^ hypothesis stating that is such that S"^'"" = Sq, T'P^" = Tq, W^'" = Uq and also 
M^'" n N'P'" = 0, [Ad-P'^l = 3 \T'P'''\. 
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Lemma 22 If for an event E there is a sequence 6{n) — > such that for all pairs (Tq, Uq), \Tq\JUq\ < 

n ► oo 

n"^" we have P {E\HtoUo) < H^) then P {E) — > 0. 

n > oo 

Proof: We can bound probability of event E as 

P(^) < E [^{E\Ht,Uo)^{Ht,Uo) 

To,C/o:|ToUC/o|<n2'' 

+P {M'f''' n N'f'" 7^ or IM'^'^'I < 3 \T'f'''\ or \Tq U Uq\ > n^")) 
< 6{n) + P (M^''' n N^^" 0) + P (IM^'H < 3 IT^'^) + P (|To U ^/qI > ra^'') • 

By Lemma El] probabilities of events M'P'" D N'^'" ^ and {M^P'^I < 3 |r'^'''| tend to as n approaches 
infinity. By Lemma[2](2) we have [To U C/o| < n^*^ whp. Thus we obtain the result. □ 



Observation 23 If is selected according to <I>P'^"*(n, gn) conditioned to HtqUq then formula 

^ i[H\{ToUC/o) 

has the same distribution as if it was generated by picking clauses from all clauses over variables 
V \ var([n'']) uniformly at random. 

Proof: Let C be the set of all clauses over variables in F \ var([n'^]) and Rq = [gn] \ (Tq U Uq). Take a 
formula such that positions from Rq of this formula are occupied by clauses from C . It suffices to observe 
that the number of formulas V'' such that ip' [r^= [r^, S'I'' = Sq, T'l''''' = Tq, f/^'-'^ = Uq is the same 
for any Tp. So since all possible formulas over variables from some set are equiprobable a random formula 
is generated by random sampling of clauses. □ 

Proof: [of Proposition [121 We will bound probability of success of Local Search under a hypothesis 
of the form HtqUq and apply Lemma [22] to get the result. Let a4 be the exponent corresponding to g by 
Lemma l20l and choose /i and v such that 04 + 2// < 1. 

Let M = M'^''^ and L = L^''^ . We split formula ip into ipi = ip [to and Lp2 = i[Qn]\To and 
first consider a run of LS applied to ip2 only. Formula ip2 can in turn be considered as the conjunction of 
^21 = 'P luo and ip22 = ip i[gn]\(ToUl/o)- Fig. [4] formula ipi consists of clauses shaded with vertical 
lines, formula (p2i of clauses shaded with diagonal lines and formula ip22 of clauses that are not shaded. By 
Observation [23]formula 9922 is sampled according to 

$Pi^*(n-5i(n),n^-(52(n)) 

modulo names of variables where 5i{n) and 62{n) are o(n). So formula ip2 is sampled according to 

^fiT^iri' - h{n),ng - 62(71)). 

By Lemma [20] the number of variables that do not play 2-righteously during run of LS on ip2 is bounded 
from above by n""' for a certain 03 < 1. 

We consider coupling {LS^, LS^^) of runs of LS on (p and p2, denoting assignments obtained by the 
runs of the algorithm at step t by u^{t) and u^^ (*) respectively. Let K be the set of those variables which 
do not belong to L (squares and circles in Fig. |4]l. Formula (p2 is a 3-CNF containing only variables from 
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K. For an assignment of values of all variables u we will denote by u\k its restriction onto variables from 
K. We make process LS^ start with a random assignment u^{0) = to all variables, and LS^^ with a 
random assignment ^,^2(0) — ^2 variables in K, such that ^^^Ik = Now the algorithms work as 
follows. At every step a random variable Xi is chosen. Process LSip makes its step, and process LS^2 makes 
its step if Xi G K. 

Whp LS^2 will run with at most n""* variables that do not play 2-righteously. Let W denote the set of 
such variables. Variables in formula ipi are selected uniformly at random so if 04 + 2/i < 1 then whp set 
M does not intersect with W. Hence, every time LS^p considers some variable from M it is 2-righteous 
in ip2 and belongs to at most one clause of ipi. Therefore such a variable is at least 1-righteous ip and is 
flipped to 1, or stays 1, whichever is to happen for LS^,^. Thus whp at every step of {LS^, LS^^) we have 
Uip{t)\K = u^2i't)- In the rest of the proof we consider only this highly probable case. 

Consider some cap support Cj = {xi,X4, x^) occupying a position i € [n'^] and such that xi = 0, X4 = 
1, X5 = at time 0, and a set Pc- of variables occurring in clauses that contain variables var(cj) (obviously 
var(cj) C P^-). Let cj be the clause that forms a cap with q. We say that a variable is discovered at step t if 
it is considered for the first time at step t. Let pi , . . . , be an ordering of elements of Pc^ according to the 
step of their discovery. In other words if variable pi is the first variable from Pc- that is discovered, was 
the last. In the case some variables are not considered at all, we place them in the end of the list in a random 
order. Observe that all variables that play at least 1-righteously are discovered at some step. All orderings 
of variables are equiprobable, hence, the probability of variables var(cj) to occupy places Pk-2,Pk-i and 
Pk equals 31/ k{k — l){k — 2). We will call this ordering unlucky. 

Let us consider what happens if the order of discovery of P^- is unlucky. All variables in Pc- \ var(ci) 
play 1-righteously, therefore once they are discovered by LS^p they equal to 1. Thus when xi,X4,X5 are 
finally considered all clauses they occur in are satisfied, except for Cj. So variables xi, X4, X5 do not change 
their values and the clause cj remains unsatisfied by the end of the work of LS^. 

By Lemma[2l;2) whp no vertex has degree greater than In^ n, so the size of the set Pc. is bounded above 
by 31n^ n. Thus the probability of event Unluck{i) ="order of discovery of var(ci) is unlucky" is greater 
than [^^- Thus, the expectation of \{i\U nluck{i)} \ equals 

l^ol _ 
In^ n In^ n 

Any variable whp occurs in clauses from T'^'^ at most once, hence there is no variable that occurs in the 
same clause with a variable from Cj^ and a variable from Ci^ for 11,12 G Sq, ii ^ 12- This implies that events 
of the form Unluck{i) are independent. Therefore random variable \{i\Unluck{i)}\ is Bernoulli and, as its 
expectation tends to infinity, the probability that it equals to goes to 0. Since unlucky ordering of at least 
one cap support leads to failure of the LS this proves the result. □ 
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